Title of dataset: covidestim databases

Contact information: covidestim research team
contact about data / project
Fayette Klaassen - postdoc fklaassen@hsph.harvard.edu
Nicolas Menzies - PI nmenzies@hsph.harvard.edu
Ted Cohen - PI theodore.cohen@yale.edu
Joshua Salomon - PI salomon1@stanford.edu


Structure of dataset:

This dataset hosts 2 archived databases

- covidestim.zip:	Compressed version of the .sql exported database from AWS on February 23, 2024. The estimates in this database start on December 1, 2021, and have a weekly underlying model.
- covidestim-legacy-<ID>.zip:	Compressed version of the .sql exported legacy database from AWS. That is, this is the database containing all runs for the 'daily model', starting at the beginning of the pandemic through December 1, 2021, and have a daily underlying model. The complete .sql file is large (>250GB), and has been cut up in pieces of 10GB each, and then individually zipped to ~5GB files. In order to reproduce the full .sql file, the covidestim-legacy-<id>.zip files need to be merged in order in a single file again.

Additional code related to the database and website organization:
Several GitHub repositories within the github.com/covidestim organization contain relevant code and documentation for the structure of the databases, the API endpoints and the website.

- db:		(private repository) describing the full database structure of the covidestim-prod database, the weekly database
- dbstan:	(under development) a repository of code to store stan output in database (memory intensive), was in development stage at completion of project
- api_docs:	description of the public API endpoints of the covidestim-prod database
- covid-dash:	repository of the covidestim.org website interface, rendering interactive visualizations using the API access from. This is where changes would need to be implemented when reinstating the databases to relaunch the website. The 'main/master' branch has the last live version of the website, while the current static version runs on 'local-setup'
- blog:		repository of blog posts hosted as a subsidiary website of covidestim.org, on noteworthy outcomes or interpreting results